PyDigger - unearthing stuff about Python


NameVersionSummarydate
rdatacompy 0.1.9 Lightning-fast dataframe comparison library built in Rust with Python bindings 2025-10-27 15:32:18
cleaning-agent 0.1.0 Intelligent data cleaning agent for automated data quality improvement 2025-10-15 07:49:25
dql-core 0.5.2 Framework-agnostic validation engine for Data Quality Language (DQL) 2025-10-10 18:28:21
dql-parser 0.5.2 Pure Python parser for Data Quality Language (DQL) 2025-10-10 18:13:39
autocsv-profiler 2.0.0 Automated CSV data analysis with statistical profiling and visualization 2025-10-09 11:50:39
databeak 0.1.2 DataBeak: MCP server for comprehensive CSV file operations with pandas-based tools 2025-10-07 13:20:43
lakehouse-engine 1.27.1 A configuration-driven Spark framework serving as the engine for several lakehouse algorithms and data flows. 2025-10-07 08:22:13
parxyval 0.1.0 An evaluation framework for document parsing. 2025-10-06 10:46:31
syndat 0.13.3 A library for evaluation & visualization of synthetic data. 2025-09-08 11:52:57
validador-cnpj 0.2.1 UDFs PySpark para limpeza, reparo, normalização e validação de CNPJ (numérico e alfanumérico). 2025-09-04 19:30:34
datacompose 0.2.6.1 Copy-pasteable data transformation primitives for PySpark. Inspired by shadcn-svelte. 2025-08-25 16:54:23
cleanengine 0.1.2 The Ultimate Data Cleaning & Analysis Toolkit 2025-08-24 13:20:31
csv-mcp-server 1.0.0 MCP server for comprehensive CSV file operations with pandas-based tools 2025-08-13 06:53:17
sparkdq 0.11.0 A declarative PySpark framework for row- and aggregate-level data quality validation. 2025-08-09 16:03:40
data-degradation-detector 1.0.5 A part of my TFM/Research project handles data drift 2025-07-21 05:45:24
lawkit-python 2.5.15 Python wrapper for lawkit - Statistical law analysis toolkit for fraud detection and data quality assessment 2025-07-16 16:47:28
diqu 0.2.0 Data Quality CLI for the Auto-Alerts 2024-07-08 03:56:04
diqu-email 1.0.0 Data Quality CLI for the Auto-Alerts - Emails 2024-07-07 04:06:59
pydeequ 1.3.0 PyDeequ - Unit Tests for Data 2024-04-26 20:35:24
compars 0.0.0 DataFrame comparison done right (AKA the Bear-agnostic DataFrame comparison library) 2024-04-20 18:28:36
hourdayweektotal
13719328372335399
Elapsed time: 6.08678s